Search CORE

22 research outputs found

The InterPro protein families and domains database: 20 years on

Author: Bateman A
Blum M
Bork P
Bridge A
Chang H-Y
Chuguransky S
Finn RD
Gough J
Grego T
Haft DH
Kandasaamy S
Letunic I
Marchler-Bauer A
Mi H
Mitchell A
Natale DA
Necci M
Nuka G
Orengo CA
Pandurangan AP
Paysan-Lafosse T
Qureshi M
Raj S
Richardson L
Rivoire C
Salazar GA
Sigrist CJA
Sillitoe I
Thanki N
Thomas PD
Tosatto SCE
Williams L
Wu CH
Publication venue
Publication date: 06/11/2020
Field of study

The InterPro database (https://www.ebi.ac.uk/interpro/) provides an integrative classification of protein sequences into families, and identifies functionally important domains and conserved sites. InterProScan is the underlying software that allows protein and nucleic acid sequences to be searched against InterPro's signatures. Signatures are predictive models which describe protein families, domains or sites, and are provided by multiple databases. InterPro combines signatures representing equivalent families, domains or sites, and provides additional information such as descriptions, literature references and Gene Ontology (GO) terms, to produce a comprehensive resource for protein classification. Founded in 1999, InterPro has become one of the most widely used resources for protein family annotation. Here, we report the status of InterPro (version 81.0) in its 20th year of operation, and its associated software, including updates to database content, the release of a new website and REST API, and performance improvements in InterProScan

UCL Discovery

The InterPro protein families database: the classification resource after 15 years.

Author: Attwood T.K.
Bateman A.
Bork P.
Chang H.Y.
Daugherty L.
Finn R.D.
Fraser M.
Gough J.
Guyot D.
Haft D.
Huang H.
Hunter S.
Kahn D.
Letunic I.
Lopez R.
McAnulla C.
McMenamin C.
Mi H.
Mitchell A.
Natale D.A.
Nuka G.
Oates M.
Orengo C.
Pesseat S.
Punta M.
Rato C.
Redaschi N.
Rivoire C.
Sangrador-Vegas A.
Scheremetjew M.
Sigrist C.J.
Sillitoe I.
Thomas P.D.
Wu C.H.
Xenarios I.
Yong S.Y.
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2015
Field of study

The InterPro database (http://www.ebi.ac.uk/interpro/) is a freely available resource that can be used to classify sequences into protein families and to predict the presence of important domains and sites. Central to the InterPro database are predictive models, known as signatures, from a range of different protein family databases that have different biological focuses and use different methodological approaches to classify protein families and domains. InterPro integrates these signatures, capitalizing on the respective strengths of the individual databases, to produce a powerful protein classification resource. Here, we report on the status of InterPro as it enters its 15th year of operation, and give an overview of new developments with the database and its associated Web interfaces and software. In particular, the new domain architecture search tool is described and the process of mapping of Gene Ontology terms to InterPro is outlined. We also discuss the challenges faced by the resource given the explosive growth in sequence data in recent years. InterPro (version 48.0) contains 36 766 member database signatures integrated into 26 238 InterPro entries, an increase of over 3993 entries (5081 signatures), since 2012

Serveur académique lausannois

The University of Manchester - Institutional Repository

InterPro in 2017-beyond protein family and domain annotations

Author: Attwood TK
Babbitt PC
Bateman A
Bork P
Bridge AJ
Chang HY
Dosztányi Z
El-Gebali S
Finn RD
Fraser M
Gough J
Haft D
Holliday GL
Huang H
Huang X
Letunic I
Lopez R
Lu S
Marchler-Bauer A
Mi H
Mistry J
Mitchell AL
Natale DA
Necci M
Nuka G
Orengo CA
Park Y
Pesseat S
Piovesan D
Potter SC
Rawlings ND
Redaschi N
Richardson L
Rivoire C
Sangrador-Vegas A
Sigrist C
Sillitoe I
Smithers B
Squizzato S
Sutton G
Thanki N
Thomas PD
Tosatto SC
Wu CH
Xenarios I
Yeh LS
Young SY
Publication venue
Publication date: 29/11/2016
Field of study

InterPro (http://www.ebi.ac.uk/interpro/) is a freely available database used to classify protein sequences into families and to predict the presence of important domains and sites. InterProScan is the underlying software that allows both protein and nucleic acid sequences to be searched against InterPro's predictive models, which are provided by its member databases. Here, we report recent developments with InterPro and its associated software, including the addition of two new databases (SFLD and CDD), and the functionality to include residue-level annotation and prediction of intrinsic disorder. These developments enrich the annotations provided by InterPro, increase the overall number of residues annotated and allow more specific functional inferences

UCL Discovery

Recommended from our members

InterPro in 2019: improving coverage, classification and access to protein sequence annotations

The InterPro database (http://www.ebi.ac.uk/interpro/) classifies protein sequences into families and predicts the presence of functionally important domains and sites. Here, we report recent developments with InterPro (version 70.0) and its associated software, including an 18% growth in the size of the database in terms on new InterPro entries, updates to content, the inclusion of an additional entry type, refined modelling of discontinuous domains, and the development of a new programmatic interface and website. These developments extend and enrich the information provided by InterPro, and provide greater flexibility in terms of data access. We also show that InterPro's sequence coverage has kept pace with the growth of UniProtKB, and discuss how our evaluation of residue coverage may help guide future curation activities

Apollo (Cambridge)

Archivio istituzionale della ricerca - Università di Padova

InterPro in 2019: improving coverage, classification and access to protein sequence annotations

Author: Attwood TK
Babbitt PC
Blum M
Bork P
Bridge A
Brown SD
Chang H-Y
El-Gebali S
Finn RD
Fraser MI
Gough J
Haft DR
Huang H
Letunic I
Lopez R
Luciani A
Madeira F
Marchler-Bauer A
Mi H
Mitchell AL
Natale DA
Necci M
Nuka G
Orengo C
Pandurangan AP
Paysan-Lafosse T
Pesseat S
Potter SC
Qureshi MA
Rawlings ND
Redaschi N
Richardson LJ
Rivoire C
Salazar GA
Sangrador-Vegas A
Sigrist CJA
Sillitoe I
Sutton GG
Thanki N
Thomas PD
Tosatto SCE
Yong S-Y
Publication venue
Publication date: 06/11/2018
Field of study

UCL Discovery

Gene Ontology Consortium: going forward

Author: Aleksander SA
Argasinska J
Argoud-Puy G
Arighi C
Attrill H
Auchincloss A
Axelsen K
Bahler J
Balakrishnan R
Basu S
Bateman A
Bely B
Berardini TZ
Binkley G
Blake JA
Blatter MC
Bonilla C
Bougueleret L
Boutet E
Breuza L
Bridge A
Britto R
Brown NH
Burgess S
Buza T
Campbell NH
Carbon S
Casals C
Chan J
Chang HY
Cherry JM
Chibucos MC
Chisholm RL
Christie KR
Cibrian-Uhalte E
Costanzo MC
Coudert E
Cusin I
D'Eustachio P
Demeter J
Denny P
Dietze H
Dodson RJ
Dolan ME
Done J
Drabkin HJ
Duek-Roggli P
Dwight SS
Dwinell M
Engel SR
Estreicher A
Famiglietti L
Feuermann M
Fey P
Finn R
Foulger RE
Fraser M
Gane P
Garmiri P
Gaudet P
Giglio MG
Gos A
Gresham C
Gruaz-Gumowski N
Harris MA
Hatton-Ellis E
Hayman GT
Hill DP
Hinz U
Hitz BC
Howe D
Hu JC
Huala E
Hulo C
Humphries SE
Huntley R
Inglis DO
Jungo F
Keller G
Kersey PJ
Kishore R
Laiho K
Laulederkind S
Lemercier P
Lewis SE
Li D
Li Y
Lieberherr D
Lloyd P
Lock A
Lomax J
Lovering RC
MacDougall A
Magrane M
Martin M
Masson P
Matthews L
McCarthy F
McDowall MD
McIntosh BK
Mi H
Mitchell A
Miyasato SR
Muller HM
Mungall CJ
Munoz-Torres MC
Muruganujan A
Mutowo P
Nash RS
Ni L
Nuka G
O'Donovan C
Oliver SG
Osumi-Sutherland D
Parkinson H
Paskov K
Pedruzzi I
Pesseat S
Petri V
Pichler K
Pillai L
Poggioli D
Poudel S
Poux S
Renfro DP
Rivoire C
Roe G
Roechert B
Roncaglia P
Rutherford K
Sangrador A
Sawford T
Scheremetjew M
Schneider M
Shimoyama M
Shypitsyna A
Siegele DA
Simison M
Sitnikov D
Skrzypek MS
Staines DM
Stephan R
Sternberg PW
Stutz A
Sundaram S
Talmud PJ
Thomas PD
Tognolli M
Tweedie S
Van Auken K
Wang H
Wang SJ
Weng S
Westerfield M
Wong ED
Wood V
Wu C
Xenarios I
Young SY
Publication venue: OXFORD UNIV PRESS
Publication date: 28/01/2015
Field of study

The Gene Ontology (GO; http://www.geneontology.org) is a community-based bioinformatics resource that supplies information about gene product function using ontologies to represent biological knowledge. Here we describe improvements and expansions to several branches of the ontology, as well as updates that have allowed us to more efficiently disseminate the GO and capture feedback from the research community. The Gene Ontology Consortium (GOC) has expanded areas of the ontology such as cilia-related terms, cell-cycle terms and multicellular organism processes. We have also implemented new tools for generating ontology terms based on a set of logical rules making use of templates, and we have made efforts to increase our use of logical definitions. The GOC has a new and improved web site summarizing new developments and documentation, serving as a portal to GO data. Users can perform GO enrichment analysis, and search the GO for terms, annotations to gene products, and associated metadata across multiple species using the all-new AmiGO 2 browser. We encourage and welcome the input of the research community in all biological areas in our continued effort to improve the Gene Ontology

UCL Discovery